ON DISTRIBUTION OF THREE-TERM ARITHMETIC 
PROGRESSIONS IN SPARSE SUBSETS OF F™ 



HOI H. NGUYEN 



Abstract. We prove a structural version of Szemeredi's regularity lemma for 
subsets of a typical random set in F™ . As an application, we give a short proof 
for an analog of a hard theorem by Kohayakawa, Luczak, and Rodl on the 
distribution of three-term arithmetic progressions in sparse sets. 



1. Introduction 

Let G be a graph and let A, B be two subsets of Vq. We define the density d(A, B) 
of G(A, B) to be 

d(A,B) :=e{A,B)/\A\\B\. 
Let e be a positive constant. We say that the pair (A, B) is e-regular if 

\d(A',B')-d(A,B)\ < e 

for any A' C A and B' C B satisfying \A'\ > e\A\ and \B'\ > e\B\. 

Szemeredi's regularity lemma, a fundamental result in combinatorics, states that 
the vertex set of any dense graph can be partitioned into not-too-small pieces so 
that almost all pairs of pieces are regular. 

Theorem 1.1 (Szemeredi's regularity lemma). Let e > 0. There exists M = M(e) 
such that the vertex set can be partitioned into 1/e < m < M sets Vi with sizes 
differing by at most 1, such that at least (1 —e)ra 2 of the pairs (Vi, Vj) are e-regular. 



Consider a vector space V — F™, where p is a fixed odd prime and n is a large 
integer. Let A be a subset of V, we define the (bipartite, directed) Cayley graph 
generated by A to be Ga = G(Vi,V2), where Vi,V^ are two copies of V, and 
(v!,v 2 ) G E(G A ) if v 2 - «i G A. 

It is clear that Ga is a regular graph of degree \A\. Hence if A is dense enough, 
then Szemeredi's regularity lemma is applicable to Ga- Furthermore, since Ga 
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has additional algebraic structure, it is natural to expect a stronger result than 
Theorem 1.1. Indeed, a result of Green [1, Section 9] confirms this intuition: 

Assume that \A\ = fl(\V\). Then one can partition V(Ga) into affine subspaces of 
large dimension and so that almost all pairs of subspaces are e -regular. 

Szemeredi's regularity lemma is not meaningful for sparse graphs in general. How- 
ever, it can be extended to certain graph families. Let £ be a positive constant. We 
say that the pair (A,B) is relatively e-regularii 

\d(A', B') — d(A, B)\ <ed(G) 
for any A' C A and B' C B satisfying \A'\ > e\A\ and \B'\ > e\B\. 
Let be given b > 2 and a > 0. We say that a graph G is (b, a)-sparse if 

d(X,Y) < bd(G) 

for any \X\ > o-\Vq\ and \Y\ > a\Vc\- The following result extends Szemereredi' s 
regularity lemma for (6, cr)-sparse graphs. 

Theorem 1.2 (Szemeredi's regularity lemma for sparse graphs, [2, Lemma 4]). 
Let b > 0. For e > there exists a — o~(b, e) such that the following holds for 
all (b,o~)-sparse graphs. There exists M — M(e,b) such that the vertex set can be 
partitioned into 1/e < m < M sets Vi with sizes differing by at most 1, such that 
at least (1 — e)m 2 of the pairs (V., Vj) are relatively e-regular. 

As to how Theorem 1.2 extends Theorem 1.1, our first goal is to point out that 
the result of Green can be extended easily to "(6, cr)-sparse" Cayley graphs in F": 

Assume that A is not too sparse, and Ga is (b,a)-sparse with some reasonable 
constants b, a . Then Theorem 1.2 is applicable to Ga in such a way that the vertex 
partitions can be taken to be affine subspaces of high dimension. 



We shall give a precise statement in Section 3. Next, let Z be an additive group 
and let a be a positive constant. We say that a subset R of Z is (a, 3AP)-dense 
if any subset of A of cardinality at least a\A\ must contain a nontrivial three-term 
arithmetic progression in Z . It has been shown in [2] that almost every subset of 
cardinality 3> a \Z\ X ^ 2 of the cyclic group Z = Z„, where n is odd, is (a, 3^4P)-dense. 
Our next goal is to prove a similar result. 

Theorem 1.3 (Kohayakawa-Luczak-Rodl theorem for F"). There exists a constant 
C = C(a) such that the following holds for all r > C(a)\V\ 1 ^ 2 . Let R be a random 
subset of size r o/F™, then the probability that R fails to be (a, 3AP)-dense is o(l). 
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To prove Theorem 1.3 we follow the approach of [2]. However, with our structure 
result in hand (Theorem 3.1), we are able to get around many technical difficulties 
to provide a much simpler proof. 

2. Notation 

Fourier transform. (cf. [4, Chapter 4.]) Let H be a subspace of V, let / be a 
real-valued function defined on V. Then the Fourier transform of / with respect to 
H is 

f(0 :=H xeH f(x)e(-{x,0) 
Where (x,£> = £? =1 x^/p, and e(z) = e 2 ™. 

Convolution. Let / and g be two real-valued functions defined on V. The convo- 
lution of / and g with respect to H is 

/ * g(h) := E xeH f(x)g(h - x). 
The following basic properties for real- valued functions will be used several times. 
. (Parseval's identity) E xeH f 2 (x) = J2 ( eH l/(0| 2 - 

• (Plancherel's formula) E xeH f(x)g(x) = X^ eJ? /(£)?(£)• 

• (Fourier inversion formula) f(x) = X^eff /(£) e (( x i 0)- 

Let A be a subset of V, and let v be an element of V. We define A V H to be the set 
A + v n H. Sometimes we also write A V H as its characteristic function. Following 
are some simple properties: 

. 3^(0 = 1^1/1^1 if£ei^ ; 

. = e((v-v',0)^(0 ifv-v'€H; in particular, \A%(£)\ = 

e- regular vector. Let e be a positive constant. Let A be a given set. We say that 
a vector v is an ^-regular vector with respect to H if 



sup \A^)\<e\A\/\V\. 
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(It is more natural to use the upper bound e|A^|/|i?| in the definition above, but 
we find our definition more convenient to use, and e|A|/|V"| is the typical value for 
e\A" H \/\H\.) 

Notice that if v is an e-regular vector, then so is any element of v + H. 

We say that a subspace H is e-regular for A if the number of v's which fail to be 
e-regular is at most e\V\. 

cr-regular set. Let a be a positive constant. We say that a subset R of V is 
cr-regular if the number of edges between X and Y in the Cayley graph Gr is as 
many as expected, 

e GR (X,Y) = (l + 0rT (l))\R\\X\\Y\/\V\, 
provided that \X\,\Y\>aN. 

Roughly speaking, a typical large random set is cr-regular for quite small cr (see 
Subsection 4.1). In particular, the set V itself is cr-regular for all a. 

Dependency of constants. We shall work with several constants throughout this 
note, so let us mention briefly here to avoid confusion. 



a, c(a) -^ri^fE^fG^fC 

First, a is the constant that we fix all the time. The constants c(a)'s depend only on 
a; these constants will appear as exponents in Section 5. Next, r\ will be chosen to 
be small enough depending on a and the c(a)'s. The constant e will be considered 
as an arbitrary constant in Section 3 and Subsection 4.1, but it will depend on a 
and rj in later sections. Last but not least, a depends on a and e. We shall choose 
rj, e, cr to be small enough, while constants C — C(a, r/, e, a) are often very large. 

Tower-type function. We shall use a tower-type function W(t) defined recur- 
sively by W(l) = 2p and W(t) = (2p) w ^-^ for t > 2. 

The note is organized as follows. In Section 3 we discuss about Green's result 
for sparse Cayley graphs. Next in Section 4 we shall provide some ingredients for 
applications. The proof of Theorem 1.3 is established in Section 5. 



3. Green's regularity lemma for (6, ct)-sparse Cayley graphs 

In this section, unless otherwise specified, all Fourier transforms and convolutions 
are taken with respect to an underlying subspace H. For short, we let N = \V\. 
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Theorem 3.1. Let a,e G (0,1). There is a constant a = a{e,a) such that the 
following holds. Let R be a a-regular set of V and let A be a subset of R of 
cardinality a\R\. Then there is a subspace H < V of index at most W (4(ea)~ 2 ) 
which is e -regular for A. 



We pause to discuss the strength of Theorem 3.1. First, since R is cr-regular, the 
Cayley graph Ga generated by A is (2/a, cr)-sparse. Indeed, for any X, Y G V such 
that \X\ > a\V\ and \Y\ > \V\ we have e GA (X,Y) < e Gn {X,Y). On the other 
hand, since R is cr-regular, we have 



Now, since Ga is (2/a, cr)-sparse, Theorem 1.2 is applicable to Ga- The advantage 
of Theorem 3.1 is, besides implying Theorem 1.2, it also provides a well-structured 
partition for the vertex set of Ga as follows. 

Let V = yjf =l Hi be the partition of V into affinc translates of H. Let v\, . . . , vk 
be representatives of the coset subgroups V/H. Then by definition, all but at most 
sK vectors v\, . . . , Vk are e-regular vectors with respect to H. 

Next assume that Hi = Vi + H and Hj = Vj+H are two afBne translates of H such 
that vj — Vi is an e-regular vector. We will show that the subgraph Ga{Hi, Hj) is 
relatively e 1 / 3 -regular. 



e GR (X,Y) = (l + 0ff (l))|ii||X||y|/7V 



< 2\R\\X\\Y\/N 

< (2/a)\A\\X\\Y\/N 
= (2/a)\X\\Y\d(G A ). 



Hence, d GA (X,Y) < {2/a)d{G A ). 



It is clear that e GA (Hi,Hj) = \H\\A 



H I' 



thus 



d GA (H i ,H j ) = \A 



Vj —Vi 

H 



\I\H\. 



Let X C Hi and Y C Hj be any two subsets of Hi and Hj respectively, which 
satisfy \X\, \Y\ > e 1 / 3 '\H\. We shall estimate the number of edges generated by X 
and Y. We have 



6 



HOI H. NGUYEN 



e G jX,Y) = ]T A(y-x)X(x)Y(y) 

xeHi,y£Hj 

= E A T Vi (y' -x')X{x' + Vi )Y(y> + Vj ) 

x',y'eH 

= E AT V \y'-x')(X-v t )(x')(Y^v,)(y') 

x',y'eH 

Now we apply the Fourier inversion formula to the last sum, 



e GA (X,Y) = |ff| 2 £^ B< (0(*-«i)(-0(5^)(0 

= \AT Vi \\X\\Y\/H + E 
ee//\{o} 

Since Vj — Vi is an e-regular vector with respect to H, we infer that 



eG ,(X,F) - \AT V *\\X\\Y\/\H\\ < ( £ \A^-^\/N)^\(X - Vi )(-0(Y -vM) 



By Parseval's identity and by the Cauchy-Schwarz inequality we thus have 



e GA (X,Y) \A%- Vt \\X\\Y\/\H\\ < \H\{e\A^\/N){\X\\Y\)^ 

< e\A v ^ Vi \\H\ 2 /N. 



It follows that 



\d GA {X,Y)-d GA {HuHj)\<e\A^\\H¥/{\X\\Y\N) 

< e 1/3 \A v i~ v *\/N 
= e^d{G A ). 



Hence GA(Hi, Hj) is indeed relatively e 1 / 3 -regular. One observes that Vj — vi is an 
e-regular vector for all but at most eK 2 pairs (i, j). Hence the partition V = LifL 1 Hi 
satisfies the conclusion of Theorem 1.2. 



Another crucial observation, which will be used later on in applications, is that 
the definition of e-rcgular vector works for any type of (linear) Cayley graph. For 
instance assume that (vi + V2)/2 is an e-regular vector with respect to H and 
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define a bipartite Cayley graph G' A on (H — vi,H — v 2 ) by connecting (hi — v\) 
with (h 2 - v 2 ) if ((hi - vi) + (h 2 - v 2 ))/2 = (hi + h 2 ) - (vi + v 2 )/2 G A. Then this 
graph G' A is also e 1 / 3 -regular. To justify this fact, the reader just needs to follow 
the same lines of verification used for G a above. 

Now we start to prove Theorem 3.1. 

Proof . Define d(A, H) by 



d(A,H) := 




Observe that d(A, H) is the mean of the squares of the normalized densities of the 
GA(Hi, HjYs. We show that this quantity is always bounded. 

Claim 3.2. We have d(A, H) < A/a 2 for any \H\ > aN. 

Proof (of Claim 3.2). Since H > aN, by the er-regularity of R, for any v we have, 

\H\\B? H \ = e GR (H,H-v) = (1 + 0<r (l))\H\\H\\R\/N. 
Hence \A V H \/\H\ < \R V H \/\H\ < 2\R\/N < (2/a)\A\/N. As a result, 

d(A,i/)<l^(2/ a ) 2 <4/ a 2 . 



As in the proof of Szemeredi's regularity lemma, when a partition with too many 
irregular pairs comes into play, then we pass to a finer partition, and by so the 
mean square of the densities will increase. What we are going to do is similar, the 
only difference is we restrict ourselves to a special family of partitions. 

Lemma 3.3. Let e G (0, 1) and suppose that H is a subspace of V , which is not 
e -regular for A. Then there is a subspace H' < H such that \V/H'\ < (2p)\ G / H \ 
andd(A,H')>d{A,H)+s 3 . 

Proof (of Lemma 3.3). Since H is not e-regular for A, there are eN vectors v 
such that snp^ H ± \A V H (£)\ > e\A\\H\/N. In other words, there exists a positive 
integer m satisfying eN/\H\ < m < N/\H\ together with m coset representatives 
Vi, . . . ,v m € V/H and vectors £1, . . . ,£ m G H, where N/\H\ > m > e\N\/\H\, such 
that 



\A^)\>e\A\/N. 
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Now let H' C H be the annihilator of all £j's. It is clear that 



\H'\ > \H\/p m > \H\/p\ v ' H \ 

Hence, 



\V/H'\ < \V/H\p\ v / H \ < (2p)l y / H L 
Set S := N\H'\ 2 (\A\/N) 2 \H\d(A,H r ). It is obvious that 



v£V veV,h£H 

Notice that \A v + h \ = Ex £ h(^ + v)(x - h)H'(x) = J2 x eh(A + v)(x)H'{x + h) = 
\H\{A V H * H')(—h). We rewrite S and then use Plancherel's formula, 



S=\H\ 2 £ \A" H *H'(h)\ 2 

vEV,heH 

= \h\* Yl \a%^h'(o\ 2 

= \H\" J2 \^k(0\ 2 \H'(0\ 2 - 
In the last sum, the contribution of the £ = term gives 



S Q = \H\ 3 J2(\ A H\/\H\) 2 (\H'\/\H\f 

= \H\\H'\ 2 Y / (\A V H \/\H\) 2 
= N\H\\H'\ 2 (\A\/N) 2 d(A,H)- 

while the sums contributed from £ G -ff\{0} is bounded from below by 



^ >|ff| 3 £ E l2|te)l 2 l^'te)l 2 - 

i=l iiGH+Ui 

But since & 6 i?'- 1 , we have /?'(&) = Use the bound > e|A|/JV 

for all 1 < i < in, we obtain 
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S^>\H\ 3 m\H\(s\A\/N) 2 (\H'\/\H\f 

>\H\ 3 {e\N\/\H\)\H\e 2 {\A\/N) 2 {\H'\/\H\f 
= e 3 \H\\H'\ 2 N{\A\/N) 2 . 

From the estimate for So and S^o we conclude that d(A, H') > d(A, H) + e 3 . 



To complete the proof of Theorem 3.1 we keep applying Lemma 3.3. Since d(A, H) < 
A/a 2 , the iteration stops after at most As~ 3 aT 2 steps. During the iteration, \H'\ is 
always bounded below by N/W(4e~ 3 a~ 2 ), thus we may choose a — (2W(4 : e~ 3 a~ 2 ))~ 1 . 



Let us conclude this section by mentioning an important corollary of Theorem 3.1. 

Theorem 3.4. Let a, £ G (0, 1) and let m be a positive integer. There is a constant 
a = a(e,a,m) such that if R is a a-regular set of V and A is a subset of R of 
cardinality a\R\, then the following holds. Assume that A = U^l 1 A i is a partition 
of A into m distinct sets of size \A\/m. Then there is a subspace H <V of index 
bounded by W(4m 2 e~ 3 a~ 2 ) which is e -regular for all Ai 's. 

To prove Theorem 3.4 first we let d(A 1 , ...,A m ,H):= Y™=i d(Ai,H). Next keep 
iterating Lemma 3.3 if H is not e-regular for some Ai. Since d(A\, . . . ,Ai,H) < 
Am 2 /a 2 , the iteration will stop after at most Am 2 e~ 3 a~ 2 steps. 

4. Main lemmas for applications 
4.1. Regularity of a random set. 

Lemma 4.2. For a > there is a constant C(a) such that if r > C{a)N 1 / 2 and 
R is a random subset of size r of V, then R is a a-regular set almost surely. 

To start with, we consider a slightly different model as follows. 

Lemma 4.3. For a > there is a constant C(a) such that if r > C{a)N 1 ^ 2 and 
q = r/N , and R is a subset ofV whose elements are equally selected with probability 
q, then R is a a-regular set almost surely. 

Proof (of Lemma 4.3) Let X, Y C V, of cardinality at least aN. The number of 
edges of Gr generated by X and Y is 

e R (X,Y)= ]T l R (y-x)l x (x)l Y (y) = N 2 J2^)hc(0^(-0 

x, y ev £ev 



10 



HOI H. NGUYEN 



where the Fourier transform is defined with respect to V, and the latter identity 
comes from Fourier inversion formula. Thus we have 



e R (X 1 Y) = \R\\X\\Y\/N + N 2 £ 
Let us pause to estimate 1.r(£). 

Lemma 4.4. sup 5 _^ | < |J2|/(JV log AT) almost surely for R. 

The proof of this lemma is routine by applying the exponential moment method. 
For the sake of completeness, we prove it in Appendix A. 

Assuming Lemma 4.4, then by the Cauchy-Schwarz inequality and Parseval's iden- 
tity we have 

1/2 

\e R {X,Y) \R\\X\\Y\/N\ < iV 2 sup |Q0l ( £ IM0| 2 £ |1^(€)| 2 ' 



<iv 2 sup|Q(c)|(|A||y|/iv 2 ) 1/2 

= sup|l^(0|(|A||r|) 1 /2A r . 



On the other hand, as \X\,\Y\ > aN/A and sup ?#0 < \R\/(N\ogN), we 

have 



su V \r R m{\X\\Y\fl 2 = o(\R\\X\\Y\/N). 



completing the proof of Lemma 4.3. 



Next we show that the two models, of Lemma 4.2 and of Lemma 4.3, are similar. 

Proof (of Lemma 4.2). Let q = (1 — er 4 )|i?|/iV. We first consider a random set 
R\ by selecting each element of V with probability q. It is obvious that the size 
of this random set belongs to [(1 — 2cr 4 )|i?|, \R\] almost surely. We restrict ourself 
to this event by renormalizing the probability space. Hence the random set R\ is 
chosen uniformly from the collection of subsets of size [(1 — 2d 4 )|i?|, \R\\. Next we 
pick uniformly a set i?2 of size \R\ — \R\ \ from V\R\ and set R — R\U i?2- 
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Suppose that X,Y C V and \X\, \Y\ > aN. Since R\ is er-regular almost surely by 
Lemma 4.3, we have {l-o a {l))\Rx\\X\\Y\/N < e Rl {X,Y) < (l+ 0<T (l))\Ri\\X\\Y\/N. 
On the other hand, it is obvious that 



e Rl (X,Y)<e R (X,Y) 

<e Rl (X,Y) + \R 2 \N 

< (l + o a (l))\R\\X\\Y\/N + 2a 4 \R\N 

= (l + o a (l))\R\\X\\Y\/N. 

Hence e R (X,Y) = (1 + o CT (l))|X||F|/./V almost surely, completing the proof of 
Lemma 4.2. ■ 



4.5. Edge distribution of quasi-random graphs. Roughly speaking, if we choose 
randomly a large number of vertices of a dense quasi-random graph, then the chance 
of obtaining an edge is very high. This simple observation, as a strong tool to ex- 
ploit structure for counting, was used in [2], and will play a key role in our proof 
of Theorem 1.3. 

Let G — G(u,p,e) be an e-regular bipartite graph, V(G) = U\\J U 2 , where \Ui \ = 
I C/2 1 = u and d(G) = e(G)/u 2 = p. Let ti,t 2 < u/2 be two given positive integers. 
We select a random subgraph of G as follows. First, an adversary chooses a set 
Si C Ui with I Si I < u/2. Then we pick a set 7\ C C/i\Si with |7\| = t Y from the 
collections of all c^-subscts of XJ\\S\ with equal probability. Next, our adversary 
picks a set S2 C U 2 with IS2I < u/2, and we pick a set T 2 C U 2 \S 2 with T 2 = t 2 
from the collections of all t2-subsets of U 2 \S 2 with equal probability. Let us call 
the outcome of the above procedure a random (ti, £2)-subgraph of H. 

Lemma 4.6. [2, Lemma 11] For every constant < n < I, there exist a constant 
< e < 1 and a natural number uo such that, for any real t > 2{u/e) 1 / 2 and any 
given graph G = G(u,p,e) as above with u > uq and p >t/u, the following holds. 
If tii ^2 > t, regardless of the choices for Si and S 2 of our adversary, the probability 
that a random (ti,t 2 ) -subgraph of G fails to contain an edge is at most if. 

The proof of Lemma 4.6 is simple, the interested reader may read [2]. 

4.7. Roth's theorem for F™. Another important ingredient which will serve as 
a starting point for our argument is Roth's theorem. 

Theorem 4.8. For any 5 > there is a number c(S) > such that if B is a 
subset of V of size S\V\, then B contains at least c(<5)|V| 2 three-term arithmetic 
progressions. 



In the next section, we shall put every thing together to establish Theorem 1.3. 
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5. Proof of Theorem 1.3 
First, by Theorem 4.8, it is enough to work with the case 



r = o a (N). 

We say that a set A which belongs to some cr-regular set R is [a, a) -bad if \A\ = 
a\R\ = ar and it contains no nontrivial three-term arithmetic progression. Our 
main goal is to give an upper bound for the number of bad sets of a given size. 

Theorem 5.1. Let a and r/ be given positive numbers. Then there exist constants 
c = c(a) > 0,C = C(r],a) > and a = a(a,rj) > such that for all s > 
C(a,n)N 1 / 2 , the number of (a,a)-bad sets of size s is at most r] c ^ s ( r ^j. 

Proof (of Theorem 1.3 assuming Theorem 5.1). We choose r\ = r\(a) to be small 
enough. Let s > C(a,rj)N 1 ^ 2 and put r = s/a. Pick a random set R among 
all r-subsets of [n]. Then by Theorem 4.2, R is cr-regular almost surely. Among 
these cr-regular r-sets, by Theorem 5.1, the number of sets that contain at least an 
(a, a)-bad subset is at most 




Observe that, as 77 is small enough, this amount is o (("))• Hence almost all r-sets 
of [n] contain no bad subsets at all. To finish the proof, we note that if R contains 
no (a, er)-bad subset, then it is (a, 3^4P)-dense. 



We shall concentrate on proving Theorem 5.1 by localizing some properties of A. 
Our approach follows that of [2] closely, but the key difference here is that we shall 
exploit rich structure obtained from Theorem 3.1 and Theorem 3.4. 

Let R be a cr-regular of fixed size C^N 1 / 2 < r = o(N) such that A C R. Let 
m = m(a) be a large number to be defined later. 

From now on we shall view A as an ordered m-set-tuple, A — {A\, . . . , A m ) where 
\Ai\ = \A\/m for all i and A = LLAj. We shall choose e = e(a) to be small enough. 
By Theorem 3.4, there exists a subspace H of V which has index bounded by 
W(4:m 2 a~ 2 e~ 3 ) and which is e-regular for all Aj's . 

Let v\, . . . , Vk be representatives of the coset subgroup V := V/H. For each Ai, 
let us consider a set Bi of vectors v that satisfy the following conditions: 

• v is e-regular with respect to Ai and H. 



13 



• 1(^)^1 > (1/4)1^11^1/^. 

It is clear that \{Ai) v H \ < A V H < \R V H \. But by definition of R, \R V H \ < 2\R\\H\/N; 
thus we have 

E 1(^)^1 > 1^1 - (eK)(2\R\\H\/N) K((l/4)\A t \\H\/N) 
> (1-H/a- 1/4)1^1 > \Ai\/2, 

provided that e < a/2m. We infer that the size of Bi is large, 



\B l \>(\A l \/2)/(2\R\\H\/N)>—K. 

By a truncation if needed, we assume that Bi has cardinality (a/Am)K for all i. 
Notice that these sets are not necessarily disjoint. We shall show that there are 
many three-term arithmetic progressions (in V') with the property that all 3 terms 
belong to different iVs. 

Now we set B := {v € V : v G Bi fl Bj fl Bk for some i < j < k} and consider the 
following two cases. 

Case 1. \B\ > (a/8m)K = (a/8m)\V'\. Applying Theorem 4.8 we obtain 
c(a/8m)K 2 three-term arithmetic progressions in B. By the definition of B, it 
follows that there are c(a/8m)K 2 three-term arithmetic progressions with the prop- 
erty that all three terms belong to three different sets Bi. 

Case 2. \B\ < (a/8m)K = \B t \/2. We let B' = U^B^B. By an elementary 
counting argument, it follows that \B'\ > m|Bj|/4 = (a/l&)K. Let us write B' = 
U™ X B[, where B[ C B t and all B[ are disjoint. 

By Theorem 4.8, the set B' contains c(a/16)K 2 three-term arithmetic progres- 
sions. Among them, since each three-term arithmetic progression is defined by two 
parameters, the number of three-term arithmetic progressions that consist of at 
least two terms from the same B[ is bounded by ^Y^iLi \^i\ 2 - The latter quantity 
is bounded by 3|Bj|(^™ \^i\) < 3(a/4m)(a/4)if 2 ; which is negligible compared to 
c(a/16)K 2 by letting m = m(a) large. 

In both cases, the number of three-term arithmetic progressions with the property 
that all three terms belong to three different sets Bi is at least c'(a)K 2 , where 
c'{a) — min(c(a/8m), c(a/16)/2). By an averaging argument, there exist three 
indices io < jo < ko such that the number of three-term arithmetic progressions 
in Bi x Bj x Bk is at least c'{a)K 2 jm 2 = c"(a)K 2 . In particular, there exist 
a vector Vi G B io and c"(a)K pairs (Vj ,v l k ) G Bj x Bk such that each triple 
(v ia , v l j o , v l k ) is a three-term arithmetic progression. 
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Let us summarize what have been achieved. 

(1) There exists a subspace H of index bounded by a function of a and e, and 
there exist A io ,Aj ,A ko and triples (vi ,v l j o ,v ko ), where 1 < I < c"'(a)K, 
such that the following holds: 

(2) v io is an e-regular vector for A io , and (A io )^° > (l/4)\Ai \\H\/N; 

(3) (A jo )<>(l/A)\A jo \\H\/N=(l/4m)a\H\/N-, 

(4) (A k0 ) V H° > (1/4)1^11^1/1^1 = (l/4m)s\H\/N; 

(5) (v io , v l j a , v l ko ) is a three-term arithmetic progression in V/H. 

One also observes that v l j o , v l ko are e-regular vectors with respect to Aj and A ko ; 
but we do not need this fact. Since this configuration arises from [2], let us call it an 
(a,e,H,i , jo, ko,Vi ,v l j o ,v ko ) -flower. Roughly speaking, the reader may visualize 
a flower with a center Ai a +Vi DH, where Ai +Vi CiH sits nicely in H, and with 
c"'(a)K petals (A jo + v l jo n H, A ko + v l ko n H). 

We denote by S the collections of all ordered m-set-tuples A = (A\, . . . , A m ) of 
size \A\ = s. 

Proposition 5.2. Let a, r\ be given. Then there exist constants c — c(a) > 0, C — 
C(a,rj) and e = e(a,rj) > such that the number of ordered m-set-tuples A = 
(Ai, . . . ,A m ) of size s, where s > C(a,r))N 1 ^ 2 , that contain a flower but not any 
non-trivial three-term arithmetic progression is at most rf^ s \S\. 

It is clear that Proposition 5.2 implies Theorem 5.1. Hence we just need to prove 
Proposition 5.2. 

First of all we shall estimate the probability that a set A that contains a given 
(a, e, io, jo, ko, Vi , £>j- , v l k ) -flower but contains no non-trivial three-term arithmetic 
progressions. On this probability space we also fix Ai and fix the size of A l - o n H 
and A l kg n H for all I. Hence, v io is an e-regular vector with respect to a fixed sets 
A io and H; and the sets Aj and A ko vary in such a way that v l j o ,v l ko satisfy (3) 
and (4) respectively (in other words, Aj , A ko intersects H — v l j o , H — v l ko in sets of 
given size). 

Without loss of generality, we assume that 2vi = vj o + v l k(j for all I. We define 
a Cayley graph between H — uj o and H — v l kg by connecting v\ G H — ?;j o to 
v 2 £ H — v l ko if (vi + u 2 )/2 G A io . Since Vi is e-regular with respect to A ial by the 
observation made before proving Theorem 3.1, this graph is also e-regular. 

By choosing e = e(a, n) = e(a) to be small enough, and recalling that TV 1 / 2 <C 
s < r — o(N), we may check that for each bipartite graph (H — Vj Q , H — v 1 ,), the 
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assumptions of Lemma 4.6 are satisfied with S\ = Ui<m<j (^ m DC^ — u j ))' ^ = 

A J n(H-v l j0 ),S 2 = Ui< m<feo (^n(ff - and T 2 °= A feo n(^-<). It 

follows that the probability each petal fails to contain a there-term arithmetic 
progression is less than rji 1 / 4 ) 8 /^^ . Hence the probability that A contains no 
non-trivial three-term arithmetic progressions is less than rf 1 / 4 ^'' '(«W™ = ^'"(«) s _ 

Now we bound the number of flowers: the number of choices for H is bounded by 
jyW(± m 2 a 1/2 e 3 ) ^ tne nunU5er f choices for (io , jo> ^0) u i > v j ' w io ) * s bounded by 
K 4+2c"( a )K ( which is independent of A). Hence there are at most N c ^ flowers. 

Putting everything together, we infer that the number of A that contains some 
flower but not any non-trivial three-term arithmetic progression is at most 

Ar C(a) ?? c'"(a) S | (S | < v c""( a )s^ 

completing the proof. 

Appendix A. Proof of Lemma 4.4 

Without loss of generality, we just work with the real part of 1 R . We shall prove 
P_r(sup^ |Sftl;j(£) > A/A) = o(l) for some appropriately chosen A. Since the 
treatment for other cases is similar, we just show that P(3?1_r(£) > A/A) is very 
small for each fixed £ ^ 0. For convenience, put 

X = Nm R (0 = Yl lfl(")»eHv,0) := ]T X v . 

v£V vev 

One observes that A is a sum of A linearly independent real variables X„'s. Choos- 
ing t to be a positive number smaller than 1, we have 

Pr(X > A) = P R (exp(tX) > exp(U)) 
< E(cxp(iA))/cxp(tA) 

= ]jE(cxp(iA t ,))/exp(tA) 

= exp(tEA) JJ E(exp(f A„ - fE(A„)))/ exp(tA) 

= cxp(tEA) JjE(exp(ty„))/cxp(U), 

where Y v := A, - £(A„) = (l R (v) - q)Sle(-(v,S)). 

Notice that \Y V \ < 1 and < t < 1. We thus have exp(tF„) < 1 + tY„ + t 2 ^ 2 . 
Hence 
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E(exp(ty„)) < 1 + E(t 2 Y*) < exp(E(t 2 Y 2 )). 
Also, because EX = q$l^2 veV e(—(v,£)) — 0, it follows that 

P(X > A) < ]jE(exp(ty t ,))/exp(tA) < cxp(i 2 ^ E(r„ 2 ))/ exp(iA). 

v£V 

On the other hand, it is clear from the definition of Y v that J2 v ev ^O^v ) — qN. 
Thus 



P(X > A) < exp(t 2 g iV-<A). 

By choosing A = \R\/logN and t = \/(2qN) = 1/(2 log N) (thus t < 1), we deduce 
that 



P(X > \R\/logN) < exp(-|i?|/(21og 2 iV). 

Hence 



P(supKl s (0 > \R\/(N log N)) < Nexp(-\R\/(2\og 2 N)) = o(l). 
(Note that the choice for A above is not optimal, but it is enough for our goal.) 
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